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Abstract 

The application of paired-end next generation sequencing approaches has made it possible to systematically 
characterize rearrangements of the cancer genome to base-pair level. Utilizing this approach, we report the first 
detailed analysis of ovarian cancer rearrangements, comparing high-grade serous and clear cell cancers, and 
these histotypes with other solid cancers. Somatic rearrangements were systematically characterized in eight 
high-grade serous and five clear cell ovarian cancer genomes and we report here the Identification of > 600 
somatic rearrangements. Recurrent rearrangements of the transcriptional regulator gene, TSHZ3, were found 
in three of eight serous cases. Comparison to breast, pancreatic and prostate cancer genomes revealed that a 
subset of ovarian cancers share a marked tandem duplication phenotype with triple-negative breast cancers. The 
tandem duplication phenotype was not linked to BRCA1/2 mutation, suggesting that other common mechanisms 
or carcinogenic exposures are operative. High-grade serous cancers arising In women with germllne BRCA1 or 
BRCA2 mutation showed a high frequency of small chromosomal deletions. These findings indicate that BRCAT/2 
germline mutation may contribute to widespread structural change and that other undefined mechanlsm(s), which 
are potentially shared with triple-negative breast cancer, promote tandem chromosomal duplications that sculpt 
the ovarian cancer genome. 
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Introduction 

Somatically acquired structural genomic rearrange- 
ments are common in most solid cancers and are espe- 
cially so in ovarian cancer [1]. Ovarian cancer is a 
collective term for a number of distinctly different 
histotypes and tumours of varying degrees of malig- 
nancy [2]. The most common epithelial ovarian can- 
cer histotypes include serous, clear cell, mucinous and 
endometrioid. Amongst these, high-grade serous can- 
cers (HGSCs) are the most common, accounting for 



approximately 70% of all cases of invasive epithelial 
ovarian cancer and the majority of deaths. 

Increasingly comprehensive genomic analyses of 
HGSCs and clear cell cancers are providing insights 
into pathways of transformation and the molecular 
determinants of response to therapy. Our previous 
gene expression profiling of HGSCs identified four 
molecular subtypes [3], which are associated with 
different clinical outcomes [3,4]. A signalling pathway 
involving MYCN, LIN28B and LET7 is associated with 
one of the four subtypes [4], suggesting that specific 
pathway activation may drive the development and 
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behaviour of one or more HGSCs molecular subtypes. 
DNA copy number analyses have shown that gains and 
losses are particularly frequent in HGSCs [5,6], with a 
high level of genomic disorganization apparent in most 
cancer samples. A recent detailed genomic analysis 
of 500 HGSCs by the TCGA consortium identified 
frequently amplified and deleted regions of the HGSC 
genome [1]. Approximately 50% of HGSCs had defects 
in the BRCAl/2 pathway, either through germline or 
somatic mutation or methylation of pathway members. 

Clear cell cancers are associated with endometriosis 
[7], and have been recently found to harbour somatic 
mutations in the ARIDIA gene in approximately 50% 
of tumours [8,9]. The patterns of gene expression and 
copy number change seen in clear cell ovarian cancer 
are distinct from HGSCs, and frequently involve ampli- 
fication and over expression of cytokines including 
IL6, receptor tyrosine kinases and other downstream 
signalling components [10,11]. The gene expression 
profiles of ovarian clear cell cancers are similar to 
renal and uterine clear cell tumours [12]. Favourable 
responses have been observed in a small number of 
ovarian clear cell patients to sunitinib [10], a drug with 
considerable activity in renal clear cell cancer. 

Next-generation DNA sequence analysis is providing 
an unprecedented level of information about the can- 
cer genome, identifying new mutations [9,13,14], the 
impact of mutagens [15,16] and novel processes that 
sculpt the cancer genome [17]. Here we used paired- 
end DNA sequencing to seek novel gene fusions and 
characterize structural changes in ovarian cancer sam- 
ples, comparing and contrasting HGSCs and clear cell 
genomes. 



Materials and methods 

Patient samples and ethics 

Tumour samples and clinical data were obtained from 
women enrolled in the Australian Ovarian Cancer 
Study (www.aocstudy.org). All participants provided 
written informed consent and Human Research Ethics 
Committee approval was obtained at the Peter Mac- 
Callum Cancer Centre (Queensland Institute of Medi- 
cal Research, University of Melbourne, Australia) and 
all participating hospitals for the study. Further clini- 
cal data, information on biospecimens and microarray 
analysis are described in the Supplementary methods 
(see Supporting information). 

Next-generation sequencing 

Next-generation sequencing and structural variant anal- 
ysis was carried out as described previously [18]. 
Briefly, 37 bp paired-end reads generated on the 
lUumina GA2 were aligned to the human reference 
genome (hgl9) using BWA. Rearrangement break- 
points were called when two or more discordantly 
mapped read pairs supported the same underlying 

Copyright © 20 1 2 Pathological Society of Great Britain and Ireland. 
Published by John Wiley & Sons, Ltd. www.pathsoc.org.uk 



event. Breakpoints were classified according to the 
relative orientations and insert sizes of the read pairs 
into those suggesting deletion, translocation, inversion 
or tandem duplication (insertion). Candidate break- 
points were confirmed as somatic by PCR on both 
tumour and matched normal DNA and mapped to base 
pair resolution by capillary sequencing. Breakpoints 
were classified according to the relative orientations 
and insert sizes of the read pairs into those suggesting 
deletion, translocation, inversion or tandem duplica- 
tion (insertion). Rearrangements were further classified 
based on integration with copy number data to include 
amplicon junctions, fold-back inversions and genomic 
shards [19]. 

Validation of gene rearrangements 

cDNA from total RNA for sequenced samples and 
validation samples was synthesized using M-MLV 
reverse transcriptase (Promega), as described previ- 
ously [20]. Endpoint RT-PCR was performed accord- 
ing to standard protocols using Thermo-Start DNA 
polymerase (ThermoScientific). Products were resolved 
by agarose gel electrophoresis and samples with visi- 
ble products (for CCNY/CREM and ATP9B rearrange- 
ments only) were subjected to a second independent 
PCR reaction with alternate primer sets. No products 
were confirmed by a second PCR reaction except for 
control samples, where rearrangements were initially 
identified. Gene expression of TSHZ3 was measured 
by quantitative reverse-transcription PCR (qRT-PCR) 
normalized to ACTB and HPRTl control genes and 
median Ct (threshold cycle) values obtained across 
eight serous samples, using a SYBR Green qPCR assay 
on the 7900HT Fast Real-Time PCR system (Applied 
Biosystems, Foster City, CA, USA). RNA was treated 
with DNase I (Promega) prior to cDNA synthesis to 
eliminate background amplification of genomic DNA. 
Primer details are given in Table S4 (see Supporting 
information). 

Germline and somatic analysis of BRCA pathway 
dysfunction 

Complete germline sequencing of BRCAl and BRCA2 
and multiplex ligation-dependent probe amplification 
(MLPA) for detection of large chromosomal aberra- 
tions were undertaken in the National Association 
of Testing Authorities (NATA) accredited Molecular 
Pathology Laboratory at the Peter MacCallum Can- 
cer Centre. Detailed methods are provided elsewhere 
(Alsop et al, in press). Somatic mutations in tumour 
samples were screened by high-resolution melt (HRM) 
analysis of all coding exons of BRCAl and BRCAl, as 
described by others [21]. Gene promoter methylation 
of tumour DNA was assessed for BRCAl, FANCF and 
PALB2 using methylation-sensitive HRM, as described 
previously [22]. Reported percentage of methylated 
allele present in tumour DNA was estimated by com- 
parison to a panel of control samples. No promoter 
methylation of FANCF or PALB2 was identified in 
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the 13 samples screened, and only one sample showed 
promoter methylation of BRCAl (see Supporting infor- 
mation, Table SI). 



Results 

To identify DNA rearrangements in ovarian cancer 
genomes, we carried out paired end sequencing of 
tumour DNA from 13 ovarian tumour samples (eight 
HGSCs and five clear cell; see Supporting informa- 
tion, Table SI). A total of 1.1 x 10^ 37 bp read pairs 
generated 7 x 10'° base pairs that could be assigned 
uniquely to the reference genome equating to 4.3-16.5- 
fold physical coverage per sample. Putative structural 
rearrangements were identified from incorrectly map- 
ping read pairs [18,23] and 634 confirmed somatically 
acquired DNA rearrangements were identified across 
13 ovarian cancers (range 13-150/sample), with base 
pair sequence level resolution of 598 individual break- 
points (94%; see Supporting information. Table S2). 
Each genome displayed a different spectrum of unique 
somatically acquired rearrangements (Figure 1), with 
varying proportions of rearrangements in each class 
(Table 1). 

By classifying tumours according to the dominant 
class of rearrangement, three distinct mutation profiles 
were observed in HGSCs. In four of the HGSCs, half 
or more of the chromosomal rearrangements involved 



tandem duplications (median 410 kb), including one 
case with 81 tandem duplication events detected 
(PD3722a). For three HGSCs, deletions were the dom- 
inant rearrangement class, with the remaining serous 
cancer sample having a genome dominated by junc- 
tions within amplicons (Table 1). The three cases 
showing a high frequency of deletions had either 
a germline BRCAl or BRCA2 mutation (PD3724a, 
PD3726a and PD3731a) and the deletions in these 
cases were small (median 3.2 kb) when compared to 
deletions in cases without germline BRCAl 12 muta- 
tions (median 288 kb). Over-representation of deletions 
in BRCAl/2 mutant compared to wild-type cases as a 
proportion of all rearrangements was statistically sig- 
nificant (p < 3.91 X 10~" by a Poisson regression 
test). Two HGSCs cases with somatic alteration of 
BRCAl, either through point mutation or promoter 
hypermethylation (PD3722a and PD3725a, respec- 
tively), did not show the deletion/translocation pheno- 
type. Fewer rearrangements were observed in ovarian 
clear cell cancers compared with HGSCs (Table 1), 
consistent with our previous copy number analysis 
[10]. The types of rearrangement present within each 
genome were more evenly distributed between the 
classes in the clear cell histotype. Tandem duplications 
were the dominant class in all five clear cell cases, but 
relatively high proportions of translocations, deletions 
and inversions were also seen (Table 1). Statistical 
analyses utilizing a generalized linear model (GLM) 
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Figure 1. Genomic rearrangements of 13 ovarian cancer genomes. The chromosomes of the reference genome are drawn around the 
circumference of each circos plot [42]. Rearrangements are represented by lines linking somatically acquired breakpoints: orange, fold 
back inversions; light blue, one or both end map to an amplicon; purple, translocations; green, inversions; red, tandem duplications; dark 
blue, deletions. BRCAl/2 mutation status or mechanism of inactivation indicated. 
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Table 1. Breakpoint frequencies in 13 ovarian cancer cases illustrating correlations with BRCA1I2 status and histotype 
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Categories with the highest proportion of breaks for each tumour are in bold italics. 



approach indicated that the occurrence of deletions 
(p = 0.0053) and amplicons {p = 0.010) was signif- 
icantly different between clear cell and HGSCs cases. 
Further, the overall p value for the probability that the 
distributions differ (from the log likelihood, when com- 
pared against the null model of no interaction between 
cancer type and rearrangement class distribution) was 
3.13 X 10-^ 

We have previously observed high proportions of 
tandem duplications in a subset of breast cancer 
genomes [23]. Given the apparent similarity in pat- 
tern and distribution of this class of rearrangement, we 
undertook a comparative analysis of ovarian and other 
solid cancers for which data was available. Although 
HGSCs and clear cell cancers are considered to have 
different aetiologies and patterns of somatic mutation, 
we considered them as a single group for the pur- 
poses of the analysis, as tandem duplications were 
found in both histotypes. We compared 634 ovarian 
cancer rearrangements identified in this study, with 
994 rearrangements identified in primary breast can- 
cers [23], 352 rearrangements in pancreatic cancers 
[19] and 464 rearrangements recently reported in an 
analysis of seven prostate cancer genomes [24]. Com- 
paring the proportions of each class of rearrangement in 
each cancer type revealed significant differences (j^p 
value for effects of interaction between cancer type 
and rearrangement type = 1.88 x 10~^*). There were 
significantly more tandem duplications in breast and 
ovarian cancers compared to pancreatic and prostate 
cancers (p = 1.6 x 10~'^). Amplified fold back inver- 
sions were much more common in pancreatic than 
breast or ovarian cancers (p = 4.94 x lO"""*). 

In total, 371 (59%) of the somatic rearrangement 
breakpoints identified in this study fell in or within 
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10 kb of gene 'footprints' (ie within Ensembl gene 
coordinate boundaries), with 16 gene footprints dis- 
rupted in more than one sample (see Supporting infor- 
mation. Table S3). While these mostly occurred in 
genes with large genomic footprints, suggesting that a 
proportion are likely passenger events, several warrant 
further consideration. TSHZ3, a homeobox transcrip- 
tion factor that has been implicated in ureter formation 
[25] and development of respiratory neurons [26], was 
the most commonly disrupted gene, occurring in 3/8 
HGSC samples. The breakpoints (caused by transloca- 
tion, inversion or tandem duplication) all lie in the only 
intron and map within 29 kb of each other (Figure 2A). 
The gene is approximately 74 kb in length and we have 
not found it deleted in other cancer types previously 
(unpublished data). 

Rearrangement of TSHZ3 was further explored by 
two-colour FISH analysis of 90 HGSCs samples on 
tissue microarrays (TMA), which included three cases 
(PD3722a, PD3724a, PD2728a) identified in the rear- 
rangement screen. In total, 11/90 HGSC cases were 
found to have rearrangements involving the TSHZ3 
locus. The small tandem duplication in PD3722a was 
not detectable on TMA FISH, whilst the transloca- 
tion in PD3724a was readily confirmed. The inver- 
sion in PD3728a was not discemable on FISH; how- 
ever, the locus was amplified. In the remaining cases, 
there were two main patterns of rearrangement, split 
roughly equally between amplification and rearrange- 
ment breaks in the gene, including two cases with 
apparently balanced breaks, confirming TSHZ3 as 
recurrently rearranged (Figure 2B). 

Assessing the importance of TSHZ3 in ovarian can- 
cer is confounded by its proximity to CCNEl, mapping 
1.45 Mb telomeric on 19ql2 (Figure 2C). CCNEl is 

;Potho/ 20 1 2; 227: 446-455 
www.thejournalofpathology.com 



450 



DJ McBride et al 



(A) 



TSHZ3 

(chr19:36,457,691-36,532,030) 



(B) 



Chr 19 



41 kb inversion 



Z U0X3 



Exon 2 



Exon 2 



Exon 1 



Exon 1 fSkb tandem E^On 1 
^ duplication 

— ^z- 



•Ih 



Translocation 



Chr 1 




(C) 



o 
O 



(D) 



(E) 




Chr 19: 28-40 Mb 



o 

en g 
O '« 

Q_ "> 
1 0) 

cr 0 
c 
O 



2.0 
1.5 
1.0 
0.5 
0.0 



o 
o 

p< 0.60 





O 






■g 


2.0 










c 


1.5 


DC 


o 


O 


U) 




D. 


03 

2 


1.0 


h- 


Q. 


DC 


X 


0.5 


cr 


Q) 
Q) 




C 

Q) 


0.0 




(S 





E 

(0 



E 
< 



°p<0.05 



S CD 

ujf2 



"D 
0) 



CO 

DC 



p< 0.001 

— I 1 1 1 

5 6 7 8 

Exon 1 -2 
223392_s_at 



Figure 2. (A) Wild-type TSHZ3 locus (top) and schematics of the three rearrangements identified by next-generation sequencing. (B) Tissue 
microarray images of four cases with varying TSHZ3 rearrangements: (1) loss of the 5' end of TSHZ3 with example of locus without 
rearrangement indicated by a white arrow; (ii) balanced break; (iii) amplification; and (iv) breakage with amplification of the 3' end of 
TSHZ3. Green, BAC probes RP11-280H11 and RP11-241C16 (5' TSHZ3); red, RP11-161K19 and RP11-164011 (3' TSHZ3). (C) Affymetrix 
SNP 6.0 copy number data for tumour sample PD7231a, showing multiple focal amplifications incorporating CCNEl and TSHZ3. (D) Scatter 
plot of TSHZ3 expression by gene copy number status, where amplification is > three copies. (E) Scatter plot showing TSHZ3 gene 
expression by qRT-PCR (left) and gene expression microarray (right) in rearranged tumours (open circles) compared to samples without 
gene rearrangement (closed circles). 



known to be amplified and operative as a cancer gene 
in serous ovarian cancer [27], and the breakpoints in 
TSHZ3 could be collateral to amplification involving 
CCNEl. To further evaluate this, those TMA cases 
with TSHZ3 breakpoints and available DNA (n = 10) 
were evaluated using Affymetrix SNP 6.0 gene arrays 
for evidence of independence from CCNEl ampli- 
fication. Five of 10 cases had high-level amplifica- 
tion of CCNEl with associated gain and breaks in 
TSHZ3, a frequency substantially above the 15-20% 
seen in unselected cases [20]. Sample PD7231a shows 
a complex pattern of copy number change, includ- 
ing multiple, focal, high-level amplifications incorpo- 
rating both CCNEl and TSHZ3 (Figure 2C). In this 
case, TSHZ3 and CCNEl are on separate segments, 
although they are presumably co-amplified. Of the five 
cases without CCNEl amplification, four had evidence 
for low-level copy-number gain (copy numbers 3 or 
4) of both TSHZ3 and CCNEl, with TSHZ3 break- 
points detectable on SNP array in two of the cases; 
PD7229a and PD7730a both had balanced rearrange- 
ments on FISH and were not detectable on SNP array, 
as expected. In the remaining case, PD3722a, there was 



no copy number gain for TSHZ3 other than the small 
tandem duplication of the locus, whereas CCNEl had 
a single copy gain. These data suggest that TSHZ3 
rearrangement can occur at least in some instances 
independently of CCNEl amplification. Additionally, 
Affymetrix SNP 6.0 data obtained from the TCGA 
ovarian cancer study (see Methods) was also evalu- 
ated for independent breakpoints affecting TSHZ3 in 
the absence of CCNEl amplification. This analysis 
revealed five cases with evidence for TSHZ3 break- 
points apparently unrelated to CCNEl gain or ampli- 
fication. 

Quantitative RT-PCR analysis showed TSHZ3 expres- 
sion was not associated with gene amplification 
(Figure 2D); however, there was evidence of reduced 
expression of TSHZ3 in rearranged cases compared 
with HGSCs without disruption of the locus (Figure 2E). 
We also compared gene expression by microarray, 
using probe sets detecting transcripts across exon 1 
and 2 boundaries (223392_s_at) and within exon 2 
only (223393_s_at). No differences between probe sets 
were detected in rearranged versus wild-type samples, 
suggesting that there is no aberrant transcription of 
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short transcripts (Figure 2E). To further identify poten- 
tial fusion transcripts we employed 5' RACE across all 
eight sequenced samples. We were, however, only able 
to detect wild-type products (data not shown). If novel 
transcripts were generated in the rearranged samples, it 
is possible that low expression may have limited their 
detection. 

These findings suggest that if TSHZ3 is contributing 
to ovarian cancer, it may be through down-regulation 
or loss of function. However, arguing against this 
is the lack of truncating mutations reported in the 
recently released data from whole-exome sequencing 
of more than 250 serous ovarian cancers, where only 
two missense and two silent somatic mutations were 
identified [1]. 

We examined TSHZ3 gene expression further in two 
large independent datasets of HGSCs (see Supporting 
information. Figure SI). Although rearrangement of 
TSHZ3 is associated with reduced gene expression, we 
found that a high level of TSHZ3 was associated with 
shorter progression-free survival only in the AOCS 
dataset (p < 0.04) [3]. In both AOCS and TCGA [1] 
cohorts, high TSHZ3 expression was associated with 
the CI molecular subtype, which is characterized by 
intense tumour stromal and epithelial-mesenchymal 
transition gene expression signatures and poor clinical 
outcome. 

Non-identical tandem duplications of approximately 
1 and 0.5 Mb were identified in 2/8 HGSCs samples 
that disrupted the gene encoding Rho GTPase activat- 
ing protein 6 (ARHGAP6) on the X chromosome (see 
Supporting information. Table S3). These two cases 
were also rearranged for TSHZ3. ARHGAP6 functions 
as a GAP for RhoA GTPase, which has been implicated 
in cell motility, and angiogenesis and has been reported 
to be up-regulated in multiple cancers, including serous 
ovarian carcinomas [28]. Normally, ARHGAP6 would 
serve to limit the activity of RhoA by activating the 
intrinsic GTPase activity resulting in conversion to the 
inactive GDP-bound form of the enzyme [29]. The 
locus encodes multiple splice forms, but no signif- 
icant expression differences were discernable in the 
rearranged versus non-rearranged cases. 

Amongst the other genes broken, ARIDIA, a member 
of the SWI/SNF chromatin remodelling complex, was 
found to be presumptively inactivated by a ~130 kb 
inversion on chromosome 1 in the clear cell case, 
PD3753a. Frequent inactivating point mutations and a 
single rearrangement were recently reported in clear 
cell ovarian cancer [8,9], consistent with the inactivat- 
ing rearrangement identified here. Indeed, four of the 
five cases reported here were also included in Wiegand 
et al [9], with one case, PD3761a, having frameshift 
mutation in ARIDIA. Also, ARIDIA has been previ- 
ously implicated in other human cancers [30,31] and 
recently found to be mutated recurrently in clear cell 
renal cell carcinoma [32]. Additionally, breakpoints in 
PPP2R2B, a component of the regulatory subunit B 
of serine/threonine phosphatase 2 (PP2A) involved in 
cell growth and division, were found in two samples 
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(one serous and one clear cell tumour). Mutations in 
the PP2A regulatory subunit A component, PPP2RIA, 
have previously been shown in ovarian clear cell can- 
cers [8]. 

Although all 13 HGSCs and clear cell cases had rear- 
rangements, no recurrent gene fusions were detected 
amongst the discovery set of samples. Nine single 
instances of in-frame fusion genes and five in-frame 
internally rearranged genes were identified (Table 2), 
all occurring in HGSCs. To investigate whether any of 
these potential gene fusions and internally rearranged 
genes were transcribed, PCR assays were designed 
to the exons surrounding the genomic breakpoints 
and RNA from the relevant cases was assayed by 
reverse transcription (RT)-PCR. Products were suc- 
cessfully amplified for 7/9 gene fusions but none of 
the internally rearranged genes, indicating that a sub- 
stantial proportion of in-frame fusions were expressed. 
RT-PCR product sequence was verified by conven- 
tional sequencing. 

In order to detect recurrence of novel rearranged 
transcripts in ovarian cancer, we screened cDNA from 
an independent validation set of 109 HGSCs for the 
same exon joining events for six of the in-frame 
fusion genes found in HGSCs cases (CCNYICREM, 
NDUFAIIISTX17, AGGFI /SCAMPI, MAP3KI ISIRPA, 
C2oif67IMARCH4, CIIorf4I IDNAJC24) and one of 
the in-frame internally rearranged transcripts {ATP9B). 
We were unable to detect expression of any of the 
fusion transcripts in the validation cohort. We next 
looked for co-expression of CCNY/CREM in our 
previous analysis of ovarian tumours [3]. We rea- 
soned that oncogenic fusion transcripts would likely 
show gene over-expression and preselection of cases 
may increase the chance of detecting recurrent low- 
frequency fusions. We identified 25 samples from 
215 HGSCs (independent of the 109-sample valida- 
tion set) that showed high expression of both CREM 
and CCNY (expression above median +0.5 x median 
absolute deviation [MAD]), representing a statisti- 
cally significantly higher proportion than would be 
expected by chance (p < 0.05). From these, RNA was 
available for 15 samples and analysed for expres- 
sion of the CCNY/CREM fusion. No fusion prod- 
ucts were detected in the selected samples, suggest- 
ing that if co-expression was the result of rearrange- 
ment/fusion, breakpoints varied from the originally 
defined boundaries. 



Discussion 

The landscape of rearrangements in HGSCs and clear 
cell ovarian cancers is characterized by a substan- 
tial proportion of cases, showing a predominance of 
tandem duplications, overlapping a phenotype previ- 
ously shown in breast cancer [23]. In addition, there is 
evidence for a deletion/translocation phenotype poten- 
tially linked to germline BRCAI 12 mutations, with a 
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Table 2. In-frame gene fusions and internal gene rearrangements 
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•Alternative splicing of the fusion transcript yields both in-frame and out-of-frame products. ND = not determined. 



high prevalence of small deletions being particularly 
marked in BRCA2 null tumours. Somatic missense 
mutation and hypermethylation of BRCAl was not, 
however, associated with the same mutation phenotype. 
Therefore, somatic events are unlikely to be func- 
tionally equivalent to germline truncating mutations 
if abrogation of BRCAl/2 function is directly linked 
to the phenotype. Overall comparison between four 
different tumour types suggests that multiple differ- 
ent mechanisms are operative in the reconstruction of 
cancer genomes, likely involving as-yet unidentified 
exposures and/or genome maintenance defects. 

The observed high frequency of tandem duplications 
in the ovarian cancer samples, especially HGSCs cases, 
adds to previous data suggesting molecular similari- 
ties between triple-negative and HGSCs [33,34]. Of 
note, the tandem duplication phenotype was present 
in both histological subtypes of ovarian cancer. This 
is of interest, as these subtypes show marked differ- 
ences in cancer gene mutations found [35], expres- 
sion clustering [10] and response to therapy. Given 
the apparently different histogenesis [36], our findings 
suggest convergent acquisition of the tandem dupli- 
cation phenotype during oncogenesis. Further, tan- 
dem duplication was not associated with germline 
BRCAl or BRCAl, suggesting that the phenotype is 
attributable to a yet to be identified common patho- 
genetic mechanism. 

A recent study by Ng et al reported on the pres- 
ence of a tandem duplicator phenotype in HGSCs 
[37]. Genome paired-end sequencing of PE01/PE04 
cell lines derived from a BRCAl mutation carrier 
showed interchromosomal breakpoints and small dele- 
tions consistent with homologous recombination (HR) 
deficiency. In contrast, the PE014/PE023, BRCA 
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wild-type cell lines had intact HR and a tandem dupli- 
cator phenotype. Using SNP copy number data, a 
'tandem duplicator- like' (TD-like) pattern was inferred 
and used to estimate the phenotype frequency in the 
TCGA dataset. TD-like copy number aberrations were 
reported to occur in 12.8% of HGSCs and, consistent 
with our findings, these were mutually exclusive to 
BRCAl/1 carrier mutations. 

We found no evidence of recurrent fusion genes in 
our study. While less common than in haematologi- 
cal malignancies and sarcomas, recurrent gene fusions 
have been identified in other solid tumours, such 
as TMPRSSl-ERG in prostate [38] and EML4-ALK 
in non-small cell lung cancers [39]. Recently, Salz- 
man et al reported on the identification of the novel 
ESRRA-CllorflO gene fusion in serous ovarian can- 
cer, using an ultra-high-throughput RNA sequencing 
approach [40]. Recurrence was reported in approxi- 
mately 15% of cases {n = 67); however, we saw no 
evidence of this rearrangement in our screen. Identifi- 
cation of ESRAA-CllorflO fusions may have been 
limited by a low frequency of recurrence and the 
small number of serous tumours analysed in our 
study. 

TSHZ3 was identified as a target of recurrent break- 
age in ovarian cancer. Gene expression analysis sug- 
gests that if TSHZ3 is contributing to pathogene- 
sis, it may be through loss of function or simply 
down-regulation. Inactivation of TSHZ3 in vivo shows 
clear gene dosage effects, leading to haploinsuffi- 
ciency, neonatal lethality and reduced heterozygote pup 
size compared to wild-type siblings [26]. Additionally, 
TSHZ3 has recently been shown to be among the most 
down-regulated genes in breast and prostate cancer, 
suggesting likely relevance in various tumour types 
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[41]. Additional work is warranted to clarify mecha- 
nisms of deregulation and the role of TSHZ3 in ovarian 
cancer. 
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The following supporting information may be found in the online version of this article: 
Supplementary methods 

Figure SI. Analysis of TSHZ3 gene expression in AOCS and TCGA datasets. 
Table SI. Sample details. 

Table S2. Summary of somatic rean'angements found in 13 ovarian cancers. 
Table S3. Breakpoints in known genes. 
Table S4. Primer sequences. 

Table S5. Tumour cell content predicted by ASCAT and PICNIC. 
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